Research on Segmentation and Labeling of Speech Corpora
نویسندگان
چکیده
In this paper, we suggested a Reference Sentence Alignment (RSA) method to segment and label the speech automatically based on the multiple pronunciation phoneme segmental kmeans algorithm and HMM. Furthermore, based on the search path created by this method, information of pitch and energy of speech can be obtained and labeled synchronously. This segmentation and labeling strategy was applied in our "863 National Project Chinese Mandarin Speech Corpora". The accuracy more than 95% can be obtained.
منابع مشابه
Automatic Labeling of Corpora for Speech
One of the bottlenecks in the development of text-to-speech synthesizers based on segment concatenation is the need for large, segmented and labeled corpora. Consequently, as manual segmentation and labeling is a tedious and time consuming task, there is a strong demand for automatic labeling systems which can label speech from many languages. Several systems have been proposed already, but the...
متن کاملRefined speech segmentation for concatenative speech synthesis
High accuracy phonetic segmentation is critical for achieving good quality in concatenative text to speech synthesis. Due to the shortcomings of current automated techniques based on HMM-based alignment or Dynamic Time Warping (DTW), manual verification and labeling are often required. In this paper we present a novel technique for automatic placement of phoneme boundaries in a speech waveform ...
متن کاملRefined Speech Segmentation for Conc
High accuracy phonetic segmentation is critical for achieving good quality in concatenative text to speech synthesis. Due to the shortcomings of current automated techniques based on HMM-based alignment or Dynamic Time Warping (DTW), manual verification and labeling are often required. In this paper we present a novel technique for automatic placement of phoneme boundaries in a speech waveform ...
متن کاملDevelopment of annotated Bangla speech corpora
This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora development process that includes considering the phonetic and prosodic features during text selection. On the other hand, a specification was maintained in the recording phase as ...
متن کاملTowards A Phoneme Labeled Mandarin Chinese Speech Corpus
Phoneme level transcription of speech corpora is crucial to fundamental speech research and the increasingly interested detection-based automatic speech recognition. Currently, there is no existing phoneme-labeled Mandarin Chinese speech corpus. This paper presents our recent work towards development of such a corpus. Our goal is to label five hours of speech data selected from a Mandarin Chine...
متن کامل